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REARRANGING  EDGEWORTH-CORNISH-FISHER  EXPANSIONS 

VICTOR  CHERNOZHUKOV+     IVAN  FERNANDEZ- VAL§     ALFRED  GALICHON^ 


Abstract.  This  paper  applies  a  regularization  procedure  called  increasing  rearrange- 
ment to  monotonize  Edgeworth  and  Cornish-Fisher  expansions  and  any  other  related 
approximations  of  distribution  and  quantile  functions  of  sample  statistics.  Besides  sat- 
isfying the  logical  monotonicity,  required  of  distribution  and  quantile  functions,  the  pro- 
cedure often  delivers  strikingly  better  approximations  to  the  distribution  and  quantile 
functions  of  the  sample  mean  than  the  original  Edgeworth-Cornish-Fisher  expansions. 
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2  improving  edgeworth  approximations 

1.  Introduction 

Higher  order  approximations  to  the  distribution  of  sample  statistics  beyond  the  or- 
der n"^/^  provided  by  the  central  limit  theorem  are  of  central  interest  in  the  theory 
of  asymptotic  statistics,  see  e.g.  (Blinnikov  and  Moessner  1998,  Cramer  1999,  Bhat- 
tacharya  and  Ranga  Rao  1976,  Hall  1992,  Rothenberg  1984,  van  der  Vaart  1998).  An 
important  tool  for  performing  these  refinements  is  provided  by  the  Edgeworth  expan- 
sion (Edgeworth  1905,  Edgeworth  1907),  which  approximates  the  distribution  of  the 
statistics  of  interest  around  the  limit  distribution  (often  the  normal  distribution)  by 
a  combination  of  Hermite  polynomials,  with  coefficients  defined  in  terms  of  moments. 
Inverting  the  expansion  yields  a  related  higher  order  approximation,  the  Cornish-Fisher 
expansion  (Cornish  and  Fisher  1938,  Fisher  and  Cornish  1960),  to  the  quantiles  of  the 
statistic  around  the  quantiles  of  the  limiting  distribution. 

One  of  the  important  shortcomings  of  either  Edgeworth  or  Cornish-Fisher  expansions 
is  that  the  resulting  approximations  to  the  distribution  and  quantile  functions  are  not 
monotonia,  which  violates  an  obvious  monotonicity  requirement.  This  comes  from  the 
fact  that  the  polynomials  involved  in  the  expansion  are  not  monotone.  Here  we  propose 
to  use  a  procedure,  called  the  rearrangement,  to  restore  the  monotonicity  of  the  approx- 
imations and,  perhaps  more  importantly,  to  improve  the  estimation  properties  of  these 
approximations.  The  resulting  improvement  is  due  to  the  fact  that  the  rearrangement 
necessarily  brings  the  non-monotone  approximations  closer  to  to  the  true  monotone 
target  function. 

The  main  findings  of  the  paper  can  be  illustrated  through  a  single  picture  given  as 
Figure  1.  In  that  picture,  we  plot  the  true  distribution  function  of  a  sample  mean  X 
based  on  a  small  sample,  a  third  order  Edgeworth  approximation  to  that  distribution, 
and  a  rearrangement  of  this  third  order  approximation.  We  see  that  the  Edgeworth 
approximation  is  sharply  non-monotone  and  provides  a  rather  poor  approximation  to 
the  distribution  function.  The  rearrangement  merely  sorts  the  value  of  the  approximate 
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Edgeworth  Expansion:  lognormal  (n  =  5) 


P(y)  (Edgeworth) 
F(y)  (Rearranged) 
F*(y)(True-MC) 


— r- 

-5 


Figure  1.  Distribution  function  for  the  standarized  sample  mean  of 
log- normal  random  variables  (sample  size  5),  the  third  order  Edgeworth 
approximation,  and  the  rearranged  third  order  Edgeworth  approximation. 


distribution  function  in  an  increasing  order.  One  can  see  that  the  rearranged  approx- 
imation, in  addition  to  being  monotonic,  is  a  much  better  approximation  to  the  true 
function  than  the  original  approximation. 

We  organize  the  rest  of  the  paper  as  follows.  In  Section  2,  we  describe  the  rearrange- 
ment and  qualify  the  approximation  property  it  provides  for  monotonic  functions.  In 
section  3,  we  introduce  the  rearranged  Edgeworth-Cornish-Fisher  expansions  and  ex- 
plain how  these  produce  better  estimates  of  distributions  and  quantiles  of  statistics.  In 
Section  4,  we  illustrate  the  procedure  with  several  additional  examples. 


4  improving  edgeworth  approximations 

2.  Improving  Approximations  of  Monotone  Functions  by  Rearrangement 

In  what  follows,  let  A'  be  a  compact  interval.  We  first  consider  an  interval  of  the  form 
X  =  [0, 1].  Let  f{x)  be  a  measurable  function  mapping  X  to  K,  a  bounded  subset  of  M. 
Let  Ff{y)  =  f^  ^{f{u)  <  y}du  denote  the  distribution  function  of  f{X)  when  X  follows 
the  uniform  distribution  on  [0, 1].  Let 

r{x)  :=  Qf{x)  :=  inf  {y  €  R  :  Ff{y)  >  x} 

be  the  quantile  function  of  Ff{y).  Thus, 


f*{x)  :=  inf  <^  y  e 


/  l{f{u)  <  y}du 
Jx 


>  X 


This  function  /*  is  called  the  increasing  rearrangement  of  the  function  /.  The  rearrange- 
ment is  a  tool  that  is  extensively  used  in  functional  analysis  and  optimal  transportation 
(see  e.g.  Hardy,  Littlewood,  and  Polya  (1952)  and  Villani  (2003).)  It  originates  in  the 
work  of  Chebyshev,  who  used  it  to  prove  a  set  of  inequalities  (Bronshtein,  Semendyayev, 
Musiol,  Muehlig,  and  Miihlig  2004).  Here  we  use  this  tool  to  improve  approximations 
of  monotone  functions,  such  as  the  Edgeworth-Cornish-Fisher  approximations  to  the 
distribution  and  quantile  functions  of  the  sample  mean. 

The  rearrangement  operator  simply  transforms  a  function  /  to  its  quantile  function 
/*.  That  is,  x  >-^  f*{x)  is  the  quantile  function  of  the  random  variable  f{X)  when 
X  ~  f/(0, 1).  Another  convenient  way  to  think  of  the  rearrangement  is  as  a  sorting 
operation:  Given  values  of  the  function  f{x)  evaluated  at  x  in  a  fine  enough  mesh  of 
equidistant  points,  we  simply  sort  the  values  in  an  increasing  order.  The  function  created 
in  this  way  is  the  rearrangement  of  /.  > 

Finally,  if -Y  is  of  the  form  [a,b],  let  x(a;)  =  {x-a)/{b-a)  G  [0, 1],  x{x)  =  a+{b  —  a)x  G 
[a,  6],  and  /*  be  the  rearrangement  of  the  function  f{x)  =  f{x{x))  defined  on  A^  =  [0, 1]. 
Then,  the  rearrangement  of  /  is  defined  as 

r{x):=r{x{x)). 
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The  following  result  is  detailed  in  Chernozhukov,  Fernandez-Val,  and  Galichon  (2006). 

Proposition  1  (Improving  Approximation  of  Monotone  Functions).  Let  fo  :  X  = 
[a,  b]  -^  K  be  a  weakly  increasing  measurable  function  in  x,  where  K  is  a  bounded 
subset  of  R.  This  is  the  target  function  that  we  want  to  approximate.  Let  f  :  X  -^  K 
be  another  measurable  function,  an  initial  approximation  or  an  estimate  of  the  target 
function  /q. 

1.  For  any  p  G  [1,  oo],  the  rearrangement  of  f,  denoted  f*,  weakly  reduces  the  estimation 
error: 


X 


rix)  - 

-  fo{x) 

p 
dx 

i/p 
< 

[L 

fix)  -  foix] 


ll/p 


dx 


(2.1) 


2.  Suppose  that  there  exist  regions  Aq  and  Xq,  each  of  measure  greater  than  5  >  Q,  such 
that  for  all  x  E  Xq  and  x'  G  Xq  we  have  that  (i)  x'  >  x,  (ii)  f{x)  >  f{x')  +  e,  and  (Hi) 
fo{x')  >  fo{x)  +  e,  for  some  e  >  0.  Then  the  gain  in  the  quality  of  approximation  is 
strict  for  p  €  (l,oo).  Namely,  for  any  p  G  [l,oo], 


1 


nx)  -  foix) 


dx 

1/) 

0 

< 

\l 

Jx 

fix)  -  foix)     dx  -  Sxrjp 


i/p 


(2.2) 


where  rjp  =  mi{\v  —  t'\^  +  1^'  —  t\^  -\v  —  t|P  -  \v'  —  t'\^}  and  rjp  >  0  forp  G  (l,C)o),  with 
the  infimum  taken  over  all  v,  v',  t,  t'  in  the  set  K  such  that  v'  >v  +  e  and  t'  >t  +  e;  and 
5;,  =  S/ib-a). 

Corollary  1  (Strict  Improvement).  //  the  target  function  /o  is  increasing  over  X  and 
f  is  decreasing  over  a  subset  of  X  that  has  positive  measure,  then  the  improvement  in 
Lp  norm,  forp  G  (l,oo),  is  necessarily  strict. 

The  first  part  of  the  proposition  states  the  weak  inequality  (2.1),  and  the  second 
part  states  the  strict  inequality  (2.2).  As  an  implication,  the  Corollary  states  that  the 
inequality  is  strict  for  p  G  (1,  oo)  if  the  original  estimate  fix)  is  decreasing  on  a  subset 
of  X  having  positive  measure,  while  the  target  function  foix)  is  increasing  on  X  (by 
increasing,  we  mean  strictly  increasing  throughout).  Of  course,  if  foix)  is  constant,  then 
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the  inequality  (2.1)  becomes  an  equality,  as  the  distribution  of  the  rearranged  function 
/*  is  the  same  as  the  distribution  of  the  original  function  /,  that  is  Ft,  =  Ft. 

This  proposition  establishes  that  the  rearranged  estimate  /*  has  a  smaller  estimation 
error  in  the  Lp  norm  than  the  original  estimate  whenever  the  latter  is  not  monotone. 
This  is  a  very  useful  and  generally  applicable  property  that  is  independent  of  the  sample 
size  and  of  the  way  the  original  approximation  /  to  /o  is  obtained. 

Remark  1.  An  indirect  proof  of  the  weak  inequality  (2.1)  is  a  simple  hut  important 
consequence  of  the  following  classical  inequality  due  to  Lorentz  (1953):  Let  q  and  g  he 
two  functions  mapping  X  to  K ,  a  hounded  suhset  of  R.  Let  q*  and  g*  denote  their 
corresponding  increasing  rearrangements.   Then, 


L 


L{q*{x),g*{x),x)dx  <    /   L{q{x),g{x),x)dx, 

Jx 


IX 

for  any  submodular  discrepancy  function  L  :  R^  >-^  M..  Set  q{x)  =  f{x),  q*{x)  =  f*{x), 
g{x)  =  fo{x),  and  g*{x)  =  /o(a:).  Now,  note  that  in  our  case  /o(x)  =  fo{x)  almost 
everywhere,  that  is,  the  target  function  is  its  own  rearrangement.  Moreover,  L{v,w,x)  = 
jiw  —  f  1^  is  suhmodular  for  p  G  [1,  oo) .  This  proves  the  first  part  of  the  proposition  ahove. 
For  p  =  oo,  the  first  part  follows  hy  taking  the  limit  as  p  —>  og.  In  the  Appendix, 
for  completeness,  we  restate  the  proof  of  Chernozhukov,  Fernandez- Val,  and  Galichon 
(2006)  of  the  strong  inequality  (2.2)  as  well  as  the  direct  proof  of  the  weak  inequality 
(2.1). 

Remark  2.  The  following  immediate  implication  of  the  ahove  finite-sample  result  is  also 
worth  emphasizing:  The  rearranged  estimate  f*  inherits  the  Lp  rates  of  convergence  from 
the  original  estimates  f.  For  p  G  [l,oo],  ^/A„  =  [/^  \fo{x)  —  f{x)\PduY^^  =  Op{an)  for 
some  sequence  of  constants  On,  then  [J^  \fo{x)  -  f*{x)\PduY^P  <  K  =  Op{an).  However, 
while  the  rate  is  the  same,  the  error  itself  is  smaller. 

Remark  3.  Finally,  one  can  also  consider  weighted  rearrangements  that  can  accentuate 
the  quality  of  approximation  in  various  areas.  Indeed,  consider  an  ahsolutely  continuous 
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distribution  function  A  on  X  =  [a,  b],  then  we  have  that  for  (/  o  A  ^)*{u)  denoting  the 
rearrangement  of  u  h->-  f[A~^{u)) 


[      (7oA-i)*(n)-/o(A-^ 

J[o,i] 


u)) 


ll/p 


du 


< 


f      f{A-\u))  -  f,{A-\u)) 
U[o,i] 


du 


i/p 


(2.3) 


or,  equivalently,  by  a  change  of  variable,  setting 

mx)  =  {foA-'r{A{x)) 

we  have  that 


Jx 


x)  -  fo{x)    dA{x) 


i/p 


< 


Ux 


m  -  foix) 


ii/p 


dAix) 


(2.4) 


Thus,  the  function  x  <—>■  /X(^)  ^■^  ^^^  weighted  rearrangement  that  provides  improvements 
in  the  quality  of  approximation  in  the  norm  that  is  weighted  according  to  the  distribution 
function  A. 

In  the  next  section,  we  apply  rearrangements  to  improve  the  Edgeworth-Cornish- 
Fisher  and  related  approximations  to  distribution  and  quantile  functions. 

3.  Improving  Edgeworth-Cornish-Fisher  and  Related  expansions 

3.1.  Improving  Quantile  Approximations  by  Rearrangement.  We  first  consider 
the  quantile  case.  Let  Qn  be  the  quantile  function  of  a  statistic  X„,  i.e. 

Qn{u)  =  inf{x  e  K  :  Pr[Xn  <  x]  >  u}, 

which  we  assume  to  be  strictly  increasing.  Let  Qn  be  an  approximation  to  the  quantile 
function  Qn  satisfying  the  following  relation  on  an  interval  Un  =  [e„,  1  —  £n]  Q  [0, 1]: 


Qn{u)  =  Qn{u)  +  en{u),    |e„(it)|  <  a„,     for  ah  u  G  Un- 


(3.1) 


The  leading  example  of  such  an  approximation  is  the  inverse  Edgeworth,  or  Cornish- 
Fisher,  approximation  to  the  quantile  function  of  a  sample  mean.    If  X„  is  is  a  stan- 


dardized sample  mean,  X„  =  Yl^^ii^i  ~  E[Yi\)/ ^/Var{Yi)  based  on  an  i.i.d.    sample 
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(Vi,  ...,Yn),  then  we  have  the  following  J-th  order  approximation 

Qn{u)  =  Qn{u)  +  eniu) 

Qn{u)  =  R,{^-\u))  +  R2{^~\u))ln"^  +  ...  +  Rj{^-\u))/n^'-'y\ 

(3.2) 
\^n{u)\  <  Cn  "'/^     for  all  ueUn  =  [e„,  1  -  e„], 

for  some  e„  \  0  and  C  >  0, 

provided  that  a  set  of  regularity  conditions,  specified  in,  e.g.,  Zolotarev  (1991),  hold. 
Here  $  and  $~^  denote  the  distribution  function  and  quantile  function  of  a  standard 
normal  random  variable.  The  first  three  terms  of  the  approximation  are  given  by  the 
polynomials, 

Ri{z)  =  z, 

R2{z)  =  X{z'-  -  l)/6,  (3.3) 

Rsiz)  =  {3k{z^  -  3z)  -  2X^{2z^  -  5z))/72, 

where  A  is  the  skewness  and  k  is  the  kurtosis  of  the  random  variable  Y.  The  Cornish- 
Fisher  approximation  is  one  of  the  central  approximations  of  the  asymptotic  statistics. 
Unfortunately,  inspection  of  the  expression  for  polynomials  (3.3)  reveals  that  this  ap- 
proximation does  not  generally  deliver  a  monotone  estimate  of  the  quantile  function. 
This  shortcoming  has  been  pointed  and  discussed  in  detail  e.g.  by  Hall  (1992).  The 
nature  of  the  polynomials  constructed  is  such  that  there  always  exists  a  large  enough 
range  Un  over  which  the  Cornish-Fisher  approximation  is  not  monotone,  cf.  Hall  (1992). 
As  an  example,  in  the  case  of  the  second  order  approximation  ( J  =  2)  we  have  that  for 
A  <  0 

Qn{u)  \  -oo,  as  u  y  I,  '  (3.4) 

that  is,  the  Cornish-Fisher  "quantile"  function  Qn  is  decreasing  far  enough  in  the  tails. 
This  example  merely  suggests  a  potential  problem  that  may  apply  to  practically  relevant 
ranges  of  probability  indices  u.  Indeed,  specific  numerical  examples  given  below  show 
that  in  small  samples  the  non-monotonicity  can  occur  in  practically  relevant  ranges.  Of 
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course,  in  sufficiently  large  samples,  the  regions  of  non-monotonicity  are  squeezed  quite 
far  into  the  tails. 

Let  Q^  be  the  rearrangement  of  Q„.  Then  we  have  that  for  any  p  E  [1,  oo],  the  rear- 
ranged quantile  function  reduces  the  approximation  error  of  the  original  approximation: 


J 


Q:{u)  -  Qn{u) 


du 


i/p 


< 


Un 


Qn{u)  -  Qn{u) 


du 


i/p 


<  (1  -  2e„)a„,      (3.5) 


with  the  first  inequality  holding  strictly  for  p  G  (1,cxd)  whenever  Qn  is  decreasing  on  a 
region  of  Un  of  positive  measure.  We  can  give  the  following  probabilistic  interpretation 
to  this  result.  Under  condition  (3.1),  there  exists  a  variable  U  =  F„(X„)  such  that  both 
the  stochastic  expansion 


and  the  expansion 


Xn  =  Qn{U)  +  Op{an), 


Xn^Q:{U)  +  Op{an), 


(3.6) 


(3.7) 


hold,-^  but  the  variable  Qn{U)  in  (3.7)  is  a  better  coupling  to  the  statistic  X„  than  Qn{U) 
in  (3.6),  in  the  following  sense:  For  each  p  G  [1,  oo], 


[E1„[X„  -  Qn{U)YY'^  <  [EUXn  -  Qn(f/)]1'/^ 


(3.8) 


where  1„  =  1{U  G  Un}-  Indeed,  property  (3.8)  immediately  follows  from  (3.5). 

The  above  improvements  apply  in  the  context  of  the  sample  mean  X„.  In  this  case, 
the  probabilistic  interpretation  above  is  directly  connected  to  the  higher  order  central 
limit  theorem  of  Zolotarev  (1991),  which  states  that  under  (3.2),  we  have  the  following 
higher-order  probabilistic  central  limit  theorem, 


Xn  =  Qn{U)  +  Op{n 


-J/2) 


(3.9) 


^QniU)  is  defined  only  on  Un,  so  we  can  set  Qn{U)  =  Qn{U)  outside  Un,  if  needed.    Of  course, 
U  ^Un  with  probability  going  to  zero. 
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The  term  Qn{U)  is  Zolotarev's  high-order  refinement  over  the  first  order  normal  terms 
$~^(f/).  Sun,  Loader,  and  McCormick  (2000)  employ  an  analogous  higher-order  proba- 
bilistic central  limit  theorem  to  improve  the  construction  of  confidence  intervals. 

The  application  of  the  rearrangement  to  the  Zolotarev's  term  in  fact  delivers  a  clear 
improvement  in  the  sense  that  it  also  leads  to  a  probabilistic  higher  order  central  limit 
theorem 

Xr.  =  Q*^{U)  +  0,{n-"^),  (3.10) 

but  the  leading  term  Q^iU)  is  closer  to  X„  than  the  Zolotarev's  term  Qn{U),  in  the 
sense  of  (3.8). 

We  summarize  the  above  discussion  into  a  formal  proposition. 

Proposition  2.  If  expansion  (3.1)  holds,  then  the  improvement  (3.5)  necessarily  holds. 
The  improvement  is  necessarily  strict  if  Qn  is  decreasing  over  a  region  ofUn  that  has  a 
positive  measure.  In  particular,  this  improvement  property  applies  to  the  inverse  Edge- 
worth  approximation  defined  in  (3.2)  to  the  quantile  function  of  the  sample  mean. 

3.2.  Improving  Distributional  Approximations  by  Rearrangement.  We  next 
consider  distribution  functions.  Let  Fn{x)  be  the  distribution  function  of  a  statistic 
Xn,  and  Fn{x)  be  the  approximation  to  this  distribution  such  that  the  following  relation 
holds: 

Fn{x)  =^  Fn{x)  +  en{x) ,    |e„(x) |  <  a„,  for  aU  x  G -^n,  (3.11) 

where  A'„  =  [— 6„,  6„]  is  an  interval  in  M  for  some  sequence  of  positive  numbers  6„  possibly 
growing  to  infinity. 

The  leading  example  of  such  an  approximation  is  the  Edgeworth  expansion  of  the 
distribution  function  of  a  sample  mean.  If  Xn  is  a  standardized  sample  mean,  Xn  = 
IZr=i(^«  ~  E\yi]) / y/V ar{Yi)  based  on  an  i.i.d.    sample  (Yl,  ...,y„),  then  we  have  the 
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following  J-th  order  approximation 

Fn{z)  =  Fniz)  +  eniz) 

Fn{x)  =  P,{x)  +  P2{x)/n"^  +  ...  +  Pj{x)/n('-^^'\ 

\^n{x)\  <  Cn~'^^'^,     for  all  x  E  Xn, 
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(3.12) 


for  some  C  >  0,  provided  that  a  set  of  regularity  conditions,  specified  in  e.g.  Hall  (1992), 
hold.  The  first  three  terms  of  the  approximation  are  given  by 


P,ix)  =  ^x), 

P2{x)  =  -X{x^  -  l)(/.(x)/6, 

P^(x)  =  -{3k{x^  -  3x)  +  \^{x^  -  lOx^  +  15x))(t){x)/72, 


(3.13) 


where  <l>  and  0  denote  the  distribution  function  and  density  function  of  a  standard 
normal  random  variable,  and  A  and  k  are  the  skewness  and  kurtosis  of  the  random 
variable  Y.  The  Edgeworth  approximation  is  one  of  the  central  approximations  of  the 
asymptotic  statistics.  Unfortunately,  like  the  Cornish-Fisher  expansion  it  generally  does 
not  provide  a  monotone  estimate  of  the  distribution  function.  This  shortcoming  has 
been  pointed  and  discussed  in  detail  by  (Barton  and  Dennis  1952,  Draper  and  Tierney 
1972,  Sargan  1976,  Balitskaya  and  Zolotuhina  1988),  among  others. 

Let  F*  be  the  rearrangement  of  F„.  Then,  we  have  that  for  any  p  G  [l,oo],  the 
rearranged  Edgeworth  approximation  reduces  the  approximation  error  of  the  original 
Edgeworth  approximation: 


Jx„ 


F:ix)-Fnix) 


dx 


\Ip 


< 


^n 


Fn{x)  -  Fn{x) 


dx 


i/v 


<  26„a„, 


(3.14) 


with  the  first  inequality  holding  strictly  for  p  £  (1,  oo)  whenever  F„  is  decreasing  on  a 
region  of  Xn  of  positive  measure. 

Proposition   3.   //  expansion  (3.11)  holds,   then  the  improvement  (3.14)  necessarily 
holds.     The  improvement  is  necessarily  strict  if  Fn  is  decreasing  over  a  region  of  Xn 
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that  has  a  positive  measure.  In  particular,  this  improvement  property  applies  to  Edge- 
worth  approximation  defined  in  (3.12)  to  the  distribution  function  of  the  sample  mean. 

3.3.  Weighted  Recirrangement  of  Cornish-Fisher  and  Edgeworth  Expansions. 

In  some  cases,  it  could  be  worthwhile  to  weigh  different  areas  of  support  differently 
than  the  Lebesgue  (fiat)  weighting  prescribes.  For  example,  it  might  be  desirable  to 
rearrange  F„  using  F„  as  a  weight  measure.  Indeed,  using  F„  as  a  weight,  we  obtain  a 
better  coupling  to  the  P-value:  P  =  Fn{Xn)  (in  this  quantity,  X„  is  drawn  according 
to  the  true  F„).  Using  such  weight  will  provide  a  probabilistic  interpretation  for  the 
rearranged  Edgeworth  expansion,  in  analogy  to  the  probabilistic  interpretation  for  the 
rearranged  Cornish-Fisher  expansion.  Since  the  weight  is  not  available  we  may  use  the 
standard  normal  measure  $  as  the  weight  measure  instead.  We  may  also  construct  an 
initial  rearrangement  with  the  Lebesgue  weight,  and  use  it  as  weight  itself  in  a  further 
weighted  rearrangement  (and  even  iterate  in  this  fashion).  Using  non-Lebesgue  weights 
may  also  be  desirable  when  we  want  the  improved  approximations  to  weight  the  tails 
more  heavily.  Whatever  the  reason  might  be  for  further  non-Lebesgue  weighting,  we 
have  the  following  properties,  which  follow  immediately  in  view  of  Remark  3. 

Let  A  be  a  distribution  function  that  admits  a  positive  density  with  respect  to  the 
Lebesgue  measure  on  the  region  Un  =  [e„,  1  —  e„]  for  the  quantile  case  and  region  Xn  = 
[— 6„,6„]  for  the  distribution  case.  Then  if  (3.1)  holds,  the  A- weighted  rearrangement 
Q*  ^  of  the  function  Q  satisfies 


[/ 


Q:,^{u)-  Qniu)      dK{u) 


\/p 


< 


J 


Qn{u)  -  Qn{u)      dA{u) 


1/p 


<      (A[l  -  Sn]  -  A[£n])an, 


(3.15) 
(3.16) 


where  the  first  equality  holds  strictly  when  Q  is  decreasing  on  a  subset  of  positive  A- 
measure.   Furthermore,  if  (3.11)  holds,  then  the  A- weighted  rearrangement  F*^  of  the 
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function  F  satisfies 


Jxr, 


F:Jx)-Fr,{x)    dA{x) 


1/p 


<  /      F„(x)-F„(x)     dA{x) 

-JXn 

<  {A[bn]  -  A[-bn])an 


1/p 


(3.17) 
(3.18) 


4.  Numerical  Examples 

In  addition  to  the  lognormal  example  given  in  the  introduction,  we  use  the  gamma  dis- 
tribution to  illustrate  the  improvements  that  the  rearrangement  provides.  Let  (Yi, ...,  F„) 
be  an  i.i.d.  sequence  of  Gamma(l/16,16)  random  variables.  The  statistic  of  interest  is 
the  standardized  sample  mean  X„  =  l^"=i(^i  ~  E[Yi\)/y/Var{Yi).  We  consider  samples 
of  sizes  n  =  4, 8, 16,  and  32.  In  this  example,  the  distribution  function  F„  and  quantile 
function  Qn  of  the  sample  mean  X„  are  available  in  a  closed  form,  making  it  easy  to  com- 
pare them  to  the  Edgeworth  approximation  F„  and  the  Cornish-Fisher  approximation 
Qn,  as  well  as  to  the  rearranged  Edgeworth  approximation  F*  and  the  the  rearranged 
Cornish-Fisher  approximation  Q^.  For  the  Edgeworth  and  Cornish-Fisher  approxima- 
tions, as  defined  in  the  previous  section,  we  consider  the  third  order  expansions,  that  is 
we  set  J  =  3. 

Figure  2  compares  the  true  distribution  function  F„,  the  Edgeworth  approximation 
F„,  and  the  rearranged  Edgeworth  approximation  F*.  We  see  that  the  rearranged  Edge- 
worth  approximation  not  only  fixes  the  monotonicity  problem,  but  also  consistently  does 
a  better  job  at  approximating  the  true  distribution  than  the  Edgeworth  approximation. 
Table  1  further  supports  this  point  by  presenting  the  numerical  results  for  the  Lp  ap- 
proximation errors,  calculated  according  to  the  formulas  given  in  the  previous  section. 
We  see  that  the  rearrangement  reduces  the  approximation  error  quite  substantially  in 
most  cases. 

Figure  3  compares  the  true  quantile  function  Qn,  the  Cornish-Fisher  approximation 
Qn,  and  the  rearranged  Cornish-Fisher  approximation  Q*.    Here  too  we  see  that  the 
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rearrangement  not  only  fixes  the  non-monotonicity  problem,  but  also  brings  the  approx- 
imation closer  to  the  truth.  Table  2  further  supports  this  point  numerically,  showing 
that  the  rearrangement  reduces  the  Lp  approximation  error  quite  substantially  in  most 
cases. 

5.  Conclusion 

In  this  paper,  we  have  applied  the  rearrangement  procedure  to  monotonize  Edgeworth 
and  Cornish-Fisher  expansions  and  other  related  expansions  of  distribution  and  quantile 
functions.  The  benefits  of  doing  so  are  twofold.  First,  we  have  obtained  estimates 
of  the  distribution  and  quantile  curves  of  the  statistics  of  interest  which  satisfy  the 
logical  monotonicity  restriction,  unlike  those  directly  given  by  the  truncation  of  the 
series  expansions.  Second,  we  have  shown  that  doing  so  resulted  in  better  approximation 
properties. 

Appendix  A.  Proof  of  Proposition  1 

We  consider  the  case  where  X  =  [0, 1]  only,  as  the  more  general  intervals  can  be  dealt 
similarly.  The  first  part  establishes  the  weak  inequality,  following  in  part  the  strategy 
in  Lorentz's  (1953)  proof.  The  proof  focuses  directly  on  obtaining  the  result  stated  in 
the  proposition.  The  second  part  establishes  the  strong  inequality. 

Proof  of  Part  1.  We  assume  at  first  that  the  functions  /(•)  and  /o(-)  are  simple 
functions,  constant  on  intervals  ((s  —  l)/r,  s/r],  s  =  1,  ...,r.  For  any  simple  /(•)  with  r 
steps,  let  /  denote  the  r- vector  with  the  s-th  element,  denoted  /s,  equal  to  the  value  of 
/(•)  on  the  s-th  interval.  Let  us  define  the  sorting  operator  S{f)  as  follows:  Let  i  be  an 
integer  in  1, ...,  r  such  that  f(,  >  fm  for  some  m  >  I.  If  f  does  not  exist,  set  S{f)  =  f.  If 
i  exists,  set  S{f)  to  be  a  r-vector  with  the  i-th.  element  equal  to  fm,  the  m-th  element 
equal  to  fi,  and  all  other  elements  equal  to  the  corresponding  elements  of  /.  For  any 
submodular  function  L  :  E^  — ^  E+,  by  /^  >  /m,  fom  >  foe  and  the  definition  of  the 
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submodularity, 

Lifrm  foe)  +  L{fe,  /om)  <  L{fe,  foe)  +  L{fm,  fom)- 

Therefore,  we  conclude  that 

/  L{S{f){x),fo{x))dx  <  [  L{fix)Jo{x))dx,  (A.l) 

Jx  Jx 

using  that  we  integrate  simple  functions. 

Applying  the  sorting  operation  a  sufficient  finite  number  of  times  to  /,  we  obtain  a 
completely  sorted,  that  is,  rearranged,  vector  /*.  Thus,  we  can  express  /*  as  a  finite 
composition  /*  =  5  o  ...  o  S{f)  .  By  repeating  the  argument  above,  each  composition 
weakly  reduces  the  approximation  error.  Therefore, 

/  L(f  (x),  fo{x))dx  <    I  L(^.^(/),  Ux))dx  <    f  Lifix),  fo{x))dx.       (A.2) 
Jx  Jx       £  .,  ^  Jx 


Furthermore,  this  inequality  is  extended  to  general  measurable  functions  /(•)  and  /o(-) 
mapping  X  to  K  by  taking  a  sequence  of  bounded  simple  functions  p^\-)  and  /o '(•) 
converging  to  /(•)  and  /o(-)  almost  everywhere  as  r  -^  oo.  The  almost  everywhere 
convergence  of  /^"^H")  to  /(")  implies  the  almost  everywhere  convergence  of  its  quantile 
function  /**'"^(-)  to  the  quantile  function  of  the  limit,  /*(■)■  Since  inequality  (A.2)  holds 
along  the  sequence,  the  dominated  convergence  theorem  imphes  that  (A.2)  also  holds 
for  the  general  case.  D 

Proof  of  Part  2.  Let  us  first  consider  the  case  of  simple  functions,  as  defined  in  Part 
1.  We  take  the  functions  to  satisfy  the  following  hypotheses:  there  exist  regions  Aq  and 
A'q,  each  of  measure  greater  than  5  >  0,  such  that  for  all  x  E  Xq  and  x'  £  A'q,  we  have 
that  (i)  x'  >  x,  (u)  f{x)  >  f[x')  +  e,  and  (in)  fo{x')  >  fo{x)  +  e,  for  e  >  0  specified  in 
the  proposition.  For  any  strictly  submodular  function  L  :  R^  ^  M+  we  have  that 

T]  =  inf{L(i/,  t)  +  L{v,  t')  -  L{v,  t)  -  L{v',  t')}  >  0, 

where  the  infimum  is  taken  over  all  v,v',t,t'  in  the  set  K  such  that  v'  >  v  +  e  and 
t'  >t  +  e. 
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We  can  begin  sorting  by  exchanging  an  element  f{x),  x  G  ^o,  of  r- vector  /  with  an 
element  f{x'),  x'  €  Xq,  of  r- vector  /.  This  induces  a  sorting  gain  of  at  least  r]  times  1/r. 
The  total  mass  of  points  that  can  be  sorted  in  this  way  is  at  least  5.  We  then  proceed  to 
sort  all  of  these  points  in  this  way,  and  then  continue  with  the  sorting  of  other  points. 
After  the  sorting  is  completed,  the  total  gain  from  sorting  is  at  least  5r].  That  is, 

/  Lirix),  fo{x))dx  <   f  L{f{x),  h{x))dx  -  57). 
Jx  Jx 

We  then  extend  this  inequality  to  the  general  measurable  functions  exactly  as  in  the 

proof  of  part  one.  D 
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Table  1.  Estimation  errors  for  approximations  to  the  distribution  func- 
tion of  the  standarized  sample  mean  from  a  Gamma(l/16,  16)  population. 


First  Order 

Third  Order 

TO- 

Rearranged 

Ratio  (TO/RTO) 

A.  n 

=  4 

Li 

0.07 

0.05 

0.02 

0.38 

L2 

0.10 

0.06 

0.03 

0.45 

Ls 

0.13 

0.07 

0.04 

0.62 

U 

0.15 

0.08 

0.07 

0.81 

Loo 

0.30 

0.48 

B.  n 

0.48 
=  8 

1.00 

Li 

0.06 

0.03 

0.01 

0.45 

L2 

0.08 

0.04 

0.02 

0.63 

L, 

0.10 

0.05 

0.04 

0.85 

U 

0.11 

0.06 

0.06 

0.96 

Loo 

0.23 

0.28 

C.  n-- 

0.28 
=  16 

1.00 

Li 

0.05 

0.01 

0.01 

0.97 

L2 

0.06 

0.02 

0.02 

0.99 

Ls 

0.08 

0.03 

0.03 

1.00 

L, 

0.08 

0.04 

0.04 

1.00 

Loo 

0.15 

0.11 

D.  n 

0.11 
=  32 

1.00 

Li 

0.04 

0.01 

0.01 

1.00 

L2 

0.05 

0.01 

0.01 

1.00 

Ls 

0.06 

0.01 

0.01 

1.00 

L, 

0.06 

0.01 

0.01 

1.00 

Loo 

0.09 

0.03 

0.03 

1.00 
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Table  2.  Estimation  errors  for  approximations  to  the  distribution  func- 
tion of  the  standarized  sample  mean  from  a  Gamma(l/16,  16)  population. 


First  Order 

Third  Order 

TO- 

Rearranged 

Ratio  (TO/RTO) 

A.  n 

=  4 

Li 

0.50 

0.24 

0.09 

0.39 

L2 

0.59 

0.32 

0.11 

0.35 

Ls 

0.69 

0.42 

0.13 

0.31 

U 

0.78 

0.52 

0.15 

0.29 

L^ 

2.04 

1.53 

B.  n 

0.49 

=  8 

0.32 

L, 

0.39 

0.08 

0.03 

0.37 

L2 

0.47 

0.11 

0.04 

0.35 

Ls 

0.56 

0.16 

0.05 

0.32 

U 

0.65 

0.21 

0.06 

0.30 

Loo 

1.66 

0.67 

C.  n- 

0.22 
=  16 

0.33 

L, 

0.28 

0.02 

0.02 

0.97 

L2 

0.35 

0.04 

0.03 

0.84 

L, 

0.43 

0.05 

0.04 

0.71 

L, 

0.51 

0.07 

0.04 

0.63 

L^ 

1.34 

0.24 

D.  n 

0.10 
=  32 

0.44 

L, 

0.20 

0.01 

0.01 

1.00 

L2 

0.26 

0.01 

0.01 

1.00 

Ls 

0.31 

0.02 

0.02 

1.00 

L, 

0.37 

0.02 

0.02 

1.00 

Loo 

1.02 

0.07 

0.07 

1.00 
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Figure  2.  Distribution  Functions,  First  Order  Approximations,  Third 
Order  Approximations,  and  Rearrangements  for  the  standarized  sample 
mean  from  a  Gamma(l/16,  16)  population. 
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Figure  3.  Quantile  Functions,  First  Order  Approximations,  Third  Order 
Approximations,  and  Rearrangements  for  the  standarized  sample  mean 
from  a  Gamma(l/16,  16)  population. 


